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For the exploration of large state spaces, symbolic search using binary decision diagrams (BDDs) 
can save huge amounts of memory and computation time. State sets are represented and modified by 
accessing and manipulating their characteristic functions. BDD partitioning is used to compute the 
image as the disjunction of smaller subimages. 

In this paper, we propose a novel BDD partitioning option. The partitioning is lexicographical in 
the binary representation of the states contained in the set that is represented by a BDD and uniform 
with respect to the number of states represented. The motivation of controlling the state set sizes in 
the partitioning is to eventually bridge the gap between explicit and symbolic search. 

Let n be the size of the binary state vector. We propose an 0(n) ranking and unranking scheme 
that supports negated edges and operates on top of precomputed satcount values. For the uniform 
split of a BDD, we then use unranking to provide paths along which we partition the BDDs. In a 
shared BDD representation the efforts are 0{n). The algorithms are fully integrated in the CUDD 
library and evaluated in strongly solving general game playing benchmarks. 

1 Introduction 

In this paper we are concerned with the space-efficient traversal of state spaces with explicit-state search 
frontiers that are too large to be kept in main memory. One of the options is a symbolic representation of 
state sets in form of characteristic functions, which are manipulated in the exploration. 

Following this approach, binary decision diagrams (BDDs) |6] have been shown to be an efficient 
data structure for representing and exploring large state sets in Model Checking [ 16] and Planning 0, 
For some domains they can save a tremendous amount of memory (wrt. an explicit representation). BDDs 
are also used as space-efficient construction pattern database search heuristics Q . 

General game playing (GGP) goes in a similar direction as automated action planning, but it can 
be seen to be even more general. In general game playing the agent, i.e., the player, has to handle any 
game that can be described by the used description language without intervention of a human, and also 
without the programmer knowing what game will be played - it has to come up with a good strategy on 
its own. The most used description mechanism is the game description language GDL [14J. A BDD 
classification algorithm for (strongly) solving general single- and two-player games has been proposed 



A rank is a number uniquely representing a state and the inverse process, called unranking, recon- 
structs the state given its rank. The approach advocated in this paper builds on top of findings of [8 ], who 
illustrated that ranking and unranking of states in a state set represented as a BDD is available in time 
linear to the length of the state vector (in binary representation). In other words, BDD ranking aims at 
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the symbolic equivalent of constructing a perfect hash function in explicit-state space search 0. For the 
construction of the perfect hash function, the underlying state set to be hashed is generated in advance in 
form of a BDD. This is plausible when computing strong solutions to problems, i.e., the game-theoretical 
value for each reachable state. Applications are, e.g., endgame databases, or planning tasks where the 
problem to be solved is harder than computing the reachability set. 

Perfect hash functions to efficiently rank and unrank states have been shown to be very successful in 
traversing single-player games, such as Rubik's Cube or the Pancake Problem Ifl3l , or two-player games 
like Awari [ 19 ]. They are also used for creating pattern databases [4]. The problem of the construction of 
perfect hash functions for algorithms like two-bit breadth-first search is that they are problem-dependent. 

BDD partitioning approaches have been proposed to address the so-called state-explosion problem, 
which refers to the observation that the size of a state space of a system tends to grow exponentially 
in the number of its variables. In this paper we indicate that lex-partitioning, short for lexicographical 
partitioning, can advance symbolic state space search. 

As our approach refines the image operation, it applies to most BDD exploration approaches in AI 
and beyond. We start with an introduction to BDDs, to symbolic search and to computing strong solutions 
in general games (Section [2]), as well as to the basic idea of partitioning BDDs (Section [3]). Next, we 
turn to ranking and unranking (Section |4]), extended to the setting where BDDs can have negated edges. 
Then, we consider the partitioning of a BDD into sub-BDDs of an equal number of satisfying assignments 
(Section[5]). Finally, we provide results in solving general games (Section[6]), discuss further implications 
and conclude (Section|7]). 

2 Binary Decision Diagrams for Strongly Solving Games 

BDDs are a memory-efficient data structure used to represent Boolean functions as well as to perform 
set-based search. In short, a BDD is a directed acyclic graph with one root and two terminal nodes, the 0- 
and the 1-sink. Each internal node corresponds to a binary variable and has two successors, one (along 
the Else-edge) representing that the current variable is false (0) and the other (along the Then-edge) 
representing that it is true (1). For any assignment of the variables derived from a path from the root to 
the 1-sink the represented function will be evaluated to 1. 

Bryant [5] proposed a fixed variable ordering, for which he also provided two reduction rules (elim- 
inating nodes with the same Then- and Else-successor and merging two nodes representing the same 
variable that share the same Then-successor as well as the same Else-successor). These BDDs are called 
reduced ordered binary decision diagrams (ROBDDs). Whenever we mention BDDs in this paper, we 
actually refer to ROBDDs. We also assume that the variable ordering is the same for all the BDDs and 
has been optimized prior to the search. 

BDDs have been shown to be very effective in the verification of hard- and software systems, where 
BDD traversal is referred to as symbolic model checking |[T6ll . Adopting terminology to state space 
search, we are interested in the image of a state set S with respect to a transition relation Trans. The 
result is a characteristic function of all states reachable from the states in S in one step. 

The image Succ of the state set S is computed as Succ(x') = 3x (Trans(x,x r ) A S(x)). The preimage, 
which determines all predecessors of the state set S, is computed as Pre{x) = 3x' (Trans(x,x') A S(x')). 

Using the image operator implementing a symbolic breadth-first search (BFS) is straight-forward. 
All we need to do is to repeatedly apply the image operator to the set of states reachable from the initial 
state found so far. The search ends when a fix-point is reached, i.e., when no new successor states can be 
found. We store the set of all reachable states as one BDD, so that, due to the structure of a BDD, this 
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does not contain any duplicates of states. 

In general game playing (GGP) we are concerned with the problem of automatically playing a game 
that the player probably has never seen before, which is very similar to the action planning. There are 
several differences, the first and foremost of course being that in GGP we are not restricted to only one 
player, but rather an arbitrary number of participants is supported. While in classical action planning the 
goal is to find a plan, i.e., a sequence of actions transforming the initial state to a goal state, as short as 
possible, in GGP each terminal state has a specific outcome for each participating player and the goal is 
to maximize the own outcome. 

Allis [ 1 ] proposed three kinds of solutions for two-player zero-sum games. In practice a weak solu- 
tion is often enough, as it allows the game to be played optimally in the sense that the player following 
the solution will never achieve an outcome worse than what was predicted for the initial state, inde- 
pendent of the moves the opponent chooses. A problem arises only when not following the solution at 
some step; in that case it might be that the game-theoretic value as well as the best move to take are not 
known for the successor state. In GGP this problem might arise when we first played following some 
heuristic that told us what to do and finished the calculation of the solution only afterward. That is why 
we chose to calculate strong solutions, which corresponds to finding the game-theoretic value for each 
reachable state, and thus to be able to determine the best move to take for every state that might ever be 
encountered. 

For the case of single-player games we might also speak about weak and strong solutions. In that 
case a weak solution corresponds to a plan that lets us reach the best possible outcome from the initial 
state, while a strong solution again tells us the best possible outcome for each reachable state, so that we 
can continue playing optimally even after a suboptimal move has been chosen. 

In order strongly solve single- or two-player games lfT2l . we find all the reachable states by perform- 
ing symbolic BFS, but instead of storing all reachable states in one BDD we store each layer separately. 
The solving starts in the last reached layer and performs regression search towards the initial state, which 
resides in layer 0. This final layer contains only terminal states (otherwise the forward search would have 
progressed further), which can be solved immediately by calculating the conjunction with the BDDs rep- 
resenting the rewards for the two players. Once this is done, the search continues in the preceding layer, 
because the remainder of the layer is empty. If another layer contains terminal states as well, those are 
solved in the same manner before continuing with the remaining states of that layer. The rewards are 
handled in a certain order (e.g., in the order win-draw-loss for the currently active player in case of 
a zero-sum game or in decreasing order in case of a single-player game). All the solved states of the 
successor layer are loaded in this order and the preimage is calculated, which results in those states of 
the current layer that will achieve the same rewards and are thus solved. 

3 Partitioning 

For several domains constructing a transition relation Trans prior to the search consumes huge amounts 
of the available computational resources. Fortunately, it is not required to build Trans monolithically, 
i.e., as one big relation. 

Provided a set of actions &f, we can partition Trans into individual transition relations Trans a for each 
action a £ s.t. Trans = \J ' aes ^ Trans a . For such a disjunctive partitioning of the transition relation the 
image now reads as 
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This image computation applies disjunctive splits for the different actions to be applied and can accelerate 
BDD exploration compared to a monolithical representation. One reason is that the relational product 
for computing an image results in many intermediate BDDs and reveals an NP hard problem (3-SAT can 
be reduced to an image operation lfl"5l ). The execution sequence of the disjunction has an effect on the 
overall running time. In this case, we organize the partitioned image in form of a binary tree, trying to 
have intermediate BDDs of similar size. 

A partitioning of S into k disjoint sets S\,...,Sk (St H Sj = for i / j) can lead to further simplified 
sub-images, so that we have 

Succ(x) = V V (Trans a (x,x) A S/(jc))) • 

\<l<kae.stf 

Our partitioning method refines the following notion of a partitioned BDD. 

Definition 1 (Partitioned BDD lfT8lD . Given a Boolean function f: {0, 1}" — > {0, 1}, defined over n inputs 
X n = {x\,. . . ,x n }, the partitioned BDD representation is a set of k function pairs (w\,fi),. . . , (wk,fk), 
where Wi,f : {0, 1}" — > {0, 1} are also defined overX n and satisfy the following four conditions. 

1. Wi and f are represented as BDDs respecting the same variable ordering as f, for 1 < i < k. 

2. w\ V . . . Vwk = 1- 

3. Wi A Wj = 0, for all i ^ j. 

4. fi = Wif\fifor\<i<k. 

We refer to the lexicographical ordering of bitvectors by using the subindex lex: for a, b 6 {0, 1 }" we 
have a <\ ex b if there is an i € {1, ... ,n} such that a, < bt and for all j G { 1 1} we have aj = bj. 
Moreover, a <i ex biff a <i ex b or a = b. 

Definition 2 (Lex-Partitioned BDD). Given a Boolean function / : {0, 1}" — > {0, 1}, defined over n inputs 
X n = {jti, . . . ,x n }, the lex-partitioned BDD representation off is a set of k assignments a\ . . .,ak € {0, 1}" 
and k functions fi,-.-,fk '• {0, 1}" — )■ {0, 1} that are also defined over X n and satisfy the following 
conditions. 

1. f are represented as BDDs respecting the same variable ordering as f, for 1 <i <k. 

2. ak = (1, ■ ■• , 1) and, for all i < k, we have a, </ ev a !+ i. 

3. ./i fk f- 

4- fj) Oforalli/j. 

5- fl = f A Va< tofll a an d /i = / A V flf _, < lex a< lex a, «. for all 1 < i < k. 

Using coefficients w\ = Va< ; ai a an ^ w i = Va,_ 1 < ;fr <a< ; „a; afot I <i <k the definition specializes 
the one of partitioned BDDs. The advantage is that by the lexicographical ordering we obtain more 
control over the evolution of BDDs resulting from a split. 

4 Ranking and Unranking 

Linear-time ranking and unranking functions with BDDs have been given in (§1. Ranking is a minimal 
perfect hash function from the set of satisfying assignments to the position of it in the lexicographical 
ordering of all satisfying assignments. Unranking is the inverse operation to ranking. 
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1 precomputeSatCount () 

2 i = level ( root ) ; 

3 satcount = 2"i * satCountAux ( root ) ; 

1 satCountAux ( n) 

2 if (n == 1— sink()) return 1; 

3 if (n == 0— sink()) return 0; 

4 if (res = lookup(n)) return res; 

5 t = sign(n) * Then ( | n | ) ; e = sign(n) * Else(|n|); 

6 i = level(|n|); j = level(t); k = level(e); 

7 satcount = (2*(j— i— 1)) * satCountAux ( t ) + 

8 (2"(k-i-l)) * satCountAux(e); 

9 i n s ert (n , s atcount ) ; 
10 return satcount; 

Figure 1: Satisfiability Counting with Negated Edges. 

Definition 3 (Ranking and Unranking). The rank of an assignment a G {0, 1}" is the position in the 
lexicographical ordering of all satisfying assignments of the Boolean function f, while the unranking of 
a number r in {0, . . . ,Cf — 1} is its inverse, with Cf being the total number of satisfying assignments of 
f- 

We have implemented the pseudo-code algorithms for the CUDD library EH . The proposal in |[8l 
does not support negated edges. Negated edges, however, are crucial, since otherwise function comple- 
mentation is not a constant time operation, at least for a BDD in a shared representation ifTTl . 

Definition 4 (Edge Complementation, Satcount, Conversion). The index of a BDD node is its unique 
position in the shared representation. For the ease of notation we take the negation of the node index to 
represent edge complementation, i.e., —n is the negated and \n\ is the regular node index. The function 
sign(ra) returns —1 if the edge is complemented and 1 if it is not, variable^) returns the variable asso- 
ciated with n and level(ra) its position in the variable ordering. Moreover, we assume the 1-sink to have 
the node index 1 and the 0-sink to have the node index —1. Let Cf = \{a G {0, l}' 1 | f{a) = 1}| denote 
the number of satisfying assignments (satcount) of f. With bin (and invbinj we denote the conversion of 
the binary value of a bitvector (and the inverse operation). 

For ranking and unranking the satcount values are precomputed for every essential subfunction and 
stored in the unique table for the shared BDD. This table is used by two functions: insert(n,v) sets a 
value v for a node n in the unique table and lookup(n) retrieves it. Memory is allocated if a node is new. 

As BDDs are reduced, not all variables on a path are present, but need to be accounted for in the 
satcount procedure. Figure [T] shows the pseudo-code of the function that does not only compute the 
values but also stores all the intermediate results and follows the proposal of J6). We see that the time 
(and space) complexity is 0(|G/|), where \GA is the number of nodes of the BDD Gf representing /. 

With negation on edges there are subtle problems to be resolved for storing the satcount values. 
While the number of satisfiable paths for a node might fit into a computer word this is not necessarily 
true for the negated subfunction. Therefore, we allow up to two satisfiability values to be stored together 
with a node: one wrt. reaching it on a negated edge, and the other one wrt. reaching it on a non-negated 
edge. In contrast to standard satisfiability count implementations (as in CUDD) this way we ensure that 
only satcount values of at most c are stored, where c is the satcount value of the root node. E.g., in 
ConnectFour 7x6 with 85 binary variables (yielding 2 85 possible values), long integers are sufficient to 
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store intermediate satcount values, which are all smaller than the satcount value of the entire reachable 
set (4,531,985,219,092). 

Figures [2] and [3] extend the proposal of (8[ and show the ranking and unranking functions and thus 
realize an invertible minimal perfect hash function for / mapping an assignment se{0,l}"toa value r G 
{0, .. . ,Cf — 1}. The procedures determine the rank given a satisfying assignment and vice versa. They 
access the satcount values on the Else-successor of each node (adding for the ranking and subtracting 
for the unranking). Missing nodes (due to BDD reduction) have to be accounted for by their binary 
representation, i.e., gaps of / missing nodes are accounted for 2'. Edge complementation changes the 
sign of the node n and is progressed to evaluation of the sinks. While the ranking procedure is recursive 
the unranking procedure is not. Both procedures track the gap imposed by the distance in the levels of 
the current and the successor node 

1 rank ( s ) 

2 i = level ( root ) ; 

3 d = bin(s [0.. i -1]); 

4 return d* satCount ( root ) + rankAux ( root , s ) — 1; 

1 rankAux ( n , s ) 

2 if ( | n j == 1) return 1; 

3 t = sign(n) * Then ( | n | ) ; e = sign(n) * Else(|n|); 

4 i = level(|n|); j = level(e); k = level(t); 

5 if ( s [ i ] == 0) 

6 return bin ( s [ i + 1 . . j — 1]) * satCount(e) + rankAux ( e , s ) ; 

7 else 

8 return 2~(j— i— 1) * satCount(e) + 

9 bin ( s [ i + 1 . . k - 1]) * satCount(t) + rankAux ( t , s ) ; 

Figure 2: Ranking with Negated Edges. 

Once the satcount values have been precomputed, both functions require linear time 0(n), where n 
is the number of variables in the function represented in the BDD. Dietzfelbinger and Edelkamp provide 
invariances showing that the procedures work correctly (H. 

5 Splitting 

Given the BDD Gf and any assignment s G {0, 1}", the split function computes the BDDs G g and Gh 
with the satisfying sets S g = S f n {b G {0, 1}" | b < lex s} and S h = S f n {b G {0, 1}" | b > tex s}. If we 
choose the assignment as the result of unranking [Cf/2\ we get C g = \ Cf/2\ and Q = |~C//2] . 

Figure [4] shows the pseudo-code of the recursive split algorithm. The input is the state vector in form 
of an assignment along which the BDD should be split. The result consists of two BDDs: the left BDD 
represents all the assignments lexicographically smaller or equal than the selected assignment a and the 
right BDD all the others. The algorithm traverses the path imposed by the input vector s bottom-up. 
Whenever needed, it allocates new nodes. If a node already exists, no allocation takes place. Depending 
on the truth value of the bitvector position lev currently processed, we swap the attachment of sub-BDDs. 

Figure [5] shows how the algorithm works in a part of the BDD. Each node in the path represented by 
the assignment a = 01 1 ... is split into two, depending on the value of the assignment for the associated 
variable. 
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1 unrank(r) 



2 i = level(root); d = r / satCount ( root ) ; 

3 s [0. . i -1] = invbin (d); 

4 n = root ; 

5 while (|n| > 1) 

6 r = r % satCount(n); 

7 t = sign(n) * Then ( | n | ) ; e = sign(n) * Else(|n|); 

8 j = level(e); k = level(t); 

9 if (r < (2~(j-i-l) * s atCount ( e ) ) ) 

10 S [i] = 0; 

11 d = r / satCount(e); 

12 s[i+l..j -1] = invbin(d); 

13 n = e ; i = j ; 

14 else 

15 s[i] = 1; 

16 r = r - (2"(j-i-l) * s atCount ( e ) ) ; 

17 d = r / satCount ( t ) ; 

18 s[i+l..k-l] = invbin(d); 

19 n = t ; i = k; 



Figure 3: Unranking with Negated Edges. 

If the assignment is 0, the recursion is made over the Else-edge. The result of the recursion is set as 
the Else-edges of the left and right part, respectively. The Then-edge points to the 0-sink in the left part 
and to the Then-successor of the original node in the right part. In the example N\ is divided into TYf 
and Nf . All the assignments with V\ = 1 are considered in the right part, while the others are split in the 
recursion. Symmetric rules apply in case that the assignment is 1 (node Nj, in the example). 

The base case corresponds to the constant node, returning the 1-sink for the left part and the 0-sink 
for the right one, assigning s to the left part. Finally, if some node in the path is missing (due to the 
elimination rule of nodes with the same Then- and Else-successor), the algorithm still splits it into two 
following the same rules described above. In the example, and Nf* are the result of the split over a 
missing node in the original BDD with both edges pointing to N3. 

Theorem 1 (Time Complexity Split Function). In a shared BDD representation given the BDD Gf and 

an assignment a G {0, 1}" the split function computes the BDDs G g and Gh in 0{n) time. 

Proof. As at most n nodes are processed in post-order, the time complexity is immediate. Moreover, 
the BDDs that are constructed are reduced. All original nodes remain valid in the shared representation 
and each new node that is created in the bottom-up traversal is checked for applicability of the BDD 
reduction rules (by issuing a look-up in the unique table). □ 

Theorem 2 (Space Complexity Split Function). We have \G g \ < \G\ +n and \Gh\ < \G\ +n. In a shared 
BDD representation we also have \G g U Gh\ < |G| +2n. 

Proof. As at most 2n nodes are created in the shared representation the second result | G g U Gh \ < | G\ + 2n 
is immediate. For each individual function G g and Gh we have constructed at most n new nodes. If we 
extract a BDD from the shared representation we duplicate nodes from G that are shared between the 
two structures. □ 
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1 pair split(s) 



2 
3 


return split Aux ( root ,0 , s ) ; 




4 


pair splitAux(n, lev, s) 




5 


z = 0— s ink ( ) ; 




6 


if (lev < level ( j n j ) ) 




7 


(sl,s2) = split Aux (n , lev + 1 , s ) ; 




8 


if(s[lev]) 




9 


left = new node ( var ( lev ), n , s 1 ) ; 




10 


right = new node ( var ( lev ), z , s2 ) ; 




11 


else 




12 


left = new node ( var ( lev ), s 1 , z ) ; 




13 


right = new node ( var ( lev ) , s2 , n ) ; 




14 


else if (n == 1 or n == 0) 




15 


left = n; right = z; 




16 


else 




17 


t = sign(n) * Then ( | n | ) ; e = sign(n 


) * 


18 


if (s [lev ]) 




19 


(tl,t2) = splitAux ( t , lev + 1 , s ) ; 




20 


left = new node ( variable ( | n | ), tl 


,e); 


21 


right = new node ( variable ( | n | ), t2 


,z); 


22 


else 




23 


(el,e2) = split Aux ( e , lev + 1 , s ) ; 




24 


left = new node ( vari able ( | n | ), z , 


el); 


25 


right = new node ( variable ( | n | ), t , 


e2); 


26 


return (left, right); 





Figure 4: Splitting with Edge Complements. 




(a) Original BDD (b) Left BDD (c) Right BDD 



Figure 5: Example of the split algorithm. The original BDD is split into its left and right part with respect 
to the assignment 011... (Dashed lines represent Else-edges). 
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By applying repeated splits we can uniformly partition a BDD into k parts in time 0{kn). 

6 Experimental Evaluation 

We have implemented our algorithms by extending the CUDD package ||2H to support the different 
satcount procedures, ranking, unranking and various split options. The solver program is written in Java 
and connected to Fabio Somenzi's BDD package CUDD with the Java Native Interface (JNI). 

We performed the experiments on one core of a 64-bit desktop computer (model Intel(R) Xeon(R) 
CPU X3470 with 2.93 GHz) running Linux (Ubuntu). This computer is equipped with 8 GB main mem- 
ory and 8,192 KB cache. For the experiments there was no need to use virtual memory. We compiled the 
CUDD package using the GNU C++ compiler (gcc version 4.3 with option -03). 

We conducted experiments in different games provided in the general description language GDL 031. 
The selection of games indicates the generality of the approach: seven single -player games (8-Puzzle, 
Asteroids Parallel, Knight's Tour, Lightsout, Lightsout 2, Peg Solitaire, and Tpeg) and five two-player 
games (Catch a Mouse, CephalopodMicro, ConnectFour 5x5, NumberTicTacToe, and TicTacToe) For 
the description of the games and their implementations in GDL we refer to the commonly used GGP 
serveiQ We used a timeout of one hour for every experiment. 

For each game, the exploration is performed in two phases [12]. First, a breadth-first search generates 
the reachable states, organized in layers. Then, a backward exploration classifies the states in each layer 
according to their reward by computing the preimage of the classified sets of the next layer. In case 
the explorations are completed, the games are strongly solved, i.e., the game-theoretical value of each 
reachable state is computed. This amounts to a combined forward and backward exploration to compute 
the set of reachable states and to classify it into sets of different game-theoretical values. The number 
of backward images is usually greater than the number of forward images as different classification sets 
have to be computed by calling the image operator. 

We compare our partitioning method to others already implemented in the CUDD library. There are 
different strategies, e.g., splitting for balancing the number of states, for balancing the number of nodes, 
and other disjunctive subset algorithms. 

When balancing the number of states, it is possible to limit the number of states in each BDD by 
splitting the original BDD in as many parts as necessary {States) or to split the BDD in a fixed number of 
folds, all of them with the same number of states (FoldStates). To get partitions with the desired number 
of states we make use of our lexicographic partitioning (Lex). We do not report experiments with another 
state-selection strategy included in the CUDD library, given that there is not significant differences wrt. 
our lexicographic version. The main difference is that our version respect the lexicographic order, which 
may be an advantage when ranking/unranking states or when assigning states to different cores in a 
distributed version. 

In order to get partitions with balanced number of nodes we consider also limiting the maximum 
number of nodes in each BDD (Nodes) or splitting the BDD in a fixed number of parts with balanced 
number of nodes (FoldNodes). The CUDD library includes several algorithms that allow splitting a BDD 
according to the number of nodes: 

• Shortest Path (SPath): Procedure to subset the given BDD choosing the shortest paths (largest 
cubes) in the BDD. 

• Compress (Comp): Finds a dense subset using several techniques in series. It is more expensive 
than other subsetting procedures, but often produces better results. 

1 http : // ggpserver. general-game-playing.de 
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There are also other methods that allow for a disjunctive decomposition in two parts {Disj) according 
to different criteria: 

• Iterative (Ite): Iterated use of supersetting to obtain a factor of the given function. The two parts 
tend to be imbalanced. 

• Generation (Gen): generalizes the decomposition based on the cof actors with respect to one vari- 
able. 

• Variable selection (Var): Decomposes the BDD according to the value of a variable, chosen to 
minimize and balance the size of the resulting BDDs. 
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Table 1 : Number of computed layers by each partitioning scheme wrt. not applying any partitioning 



Table [T] shows a comparison of the number of layers explored using each partitioning before the 
timeout. In half of the domains all the partitioning versions are able to finish the exploration. In the 
other domains most partitions are close to the exploration without partitioning solving more than 90% of 
the layers, except the three versions without a bounded number of partitions (States Lex 100000, Nodes 
Shortest Path 10000 and Nodes Compress 10000). Fold States Lex 8 dominates the other partitioning 
methods in all the domains, being the only one able to compute more layers than the version without 
partitioning in one domain: Asteroids Parallel. 

Tables [2] and [3] show the relative time spent in solving each game and the maximum number of nodes 
per image, respectively, for each partitioning configuration wrt. the "None" partitioning version. For 
those games where some partitioning strategy did not finish due to a timeout, only layers solved by the 
algorithm and the "None" partitioning version were taken into account. 

In distributed or external memory settings the maximum number of nodes involved in a single image 
determines the memory needed. Thus, the results show that BDD partitioning can help to solve problems 
by reducing the memory requirements, at the cost of increasing the time spent. Versions splitting the 
BDDs in an unbounded number of parts achieve good memory reductions but they do not scale well, 
both when balancing the BDDs according to the number of states or according to the number of nodes. 



76 



Lex-Partitioning 





rotates 


States 


FNodes 


FNodes 


Nodes 


Nodes 


Disj 


Disj 


Disj 




Lex 


Lex 


Comp 


Sratn 


Comp 


oratn 


var 


ite 


Oen 




Q 
O 


1 nnnnn 
1 uuuuu 


Q 
O 


Q 
O 


1 nnnn 

1UUUU 


1 nnnn 

1UUUU 








o 1 

8-puzzle 


1.1 / 


1 (\A 
V .U4 


1 77 
V.I 1 


1 A'X 
1.4 j 


1 7^ 

l.Zj 


1 m 

l.U/ 


1 Q1 

i.yi 


111 
i.ii 


Z. 1 J 


ConnectFour 5x5 


1 AA 
1.44 


1 

V.j 1 


"X OA 
J.Z4 


7 78 
Z.Zo 


7 77 
/ . /Z 


^ 1A 
J. /4 


7 Q7 
L.y 1 


1 ^n 

1 . JU 


1 7<N 

J. /J 


Knights Tour 


1 "XI 
V.J 1 


1 A(\ 
V .4U 


z.uu 


1 Q7 
V.yZ 


^ At 
J.4J 


A 74 
4.Z4 


7 on 
z.yu 


1 At 
1.4 J 


i n^ 

J.UJ 


Lightsout 


1.Z1 


1 TX 
V.Zj 


l.JU 


1 17 
l.JZ 


7 (Sk 
Z.OJ 


7 7n 
z.zu 


1 ^7 

1. J / 


1 Ifs 
V .ZD 


7 Al 
Z.4 / 


reg 


1 (YX 

i.uj 


1 OA 
l.Uo 


1 C\A 
1.U4 


1 n<; 
I.Uj 


111 
1.1 J 


1 no 


1 Ci< 

I.Uj 


1 (YX 
I.UJ 


1 (YX 
I.Uj 


.rVsLCIUlLls ralallCl 


n fin 


n ^ 

VJ.JJ 


U.7J 


n Q7 


n 1 8 


n 1Q 

U.17 


n ^n 

VJ.JVJ 


u.UU 


n if, 

yj. i u 


Lightsout 2 


1.22 


1.23 


1.56 


1.32 


2.65 


2.21 


1.57 


1.26 


2.46 


TPeg 


1.02 


1.04 


1.02 


0.98 


1.10 


1.08 


1.04 


1.01 


1.03 


Catena Mouse 


1.71 


166.53 


1.73 


1.29 


1.21 


1.06 


2.22 


1.01 


1.79 


Cephalopod Micro 


1.03 


1.21 


1.05 


1.07 


1.21 


1.17 


1.05 


1.03 


1.10 


Number TicTacToe 


1.18 


1.20 


1.71 


1.89 


2.38 


2.55 


1.55 


1.42 


2.16 


TicTacToe 


1.84 


3.30 


1.88 


1.65 


2.71 


2.55 


1.73 


1.34 


2.96 


Total 


0.99 


1.23 


1.12 


1.09 


1.15 


1.08 


1.01 


0.97 


1.19 



Table 2: Relative time spent in solving each game for each partitioning scheme wrt. not applying any 
partitioning 
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Table 3: Relative maximum number of nodes per image for each partitioning scheme wrt. not applying 
any partitioning 
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On the other hand, versions applying a partitioning in 2 or 8 folds do not impose a large overhead, being 
around 50 percent slower than the no-partitioning version. 

When comparing the way to select subsets, in general the lexicographic partitioning allows great 
reductions in the necessary memory while not producing a large overhead in time. It gets the best 
coverage results among all partitioning methods, being the only one able to improve the coverage of 
the non-partitioning version in one domain. Furthermore, of all the strategies with a fixed number of 
partitions, it is the one achieving largest memory savings and the second best in overall time. 

In the plots (Figures[6]-[8]) we provide information on the images computed during the exploration in 6 
different games of varying difficulty comparing different partitioning options. We omit the methods with 
less coverage (States Lex 100000, Nodes Shortest Path 10000 and Nodes Compress 10000) to visualize 
well the differences between the other versions. Both search directions are provided in one plot, separated 
by a vertical line. 

All the selected partitioning options fail to finish the exploration on TPeg and Asteroids Parallel, 
while they complete the search in all the other domains. In TPeg, the non-partitioning version is the one 
exploring more layers. On the other hand, in Asteroids Parallel, the lexicographic partitioning achieves 
the best results. 

Figure [6] shows the time spent in computing each layer by the different partitioning methods. In 
general, applying partitioning over the BDDs slows the image computation, except in Asteroids Parallel. 
However, the overhead does not appear prohibitive for the application of partitioning in order to reduce 
the memory requirements. The efficiency of the partitioning methods is consistent within each domain, 
across all the layers. Among the different partitioning options, the States Lex version is the closest to the 
none partitioning across the different domains, followed by Disjunctive Iterative and FoldNodes SPath 8. 

Figure [7] shows the total number of nodes involved in computing each layer, when applying each 
different method. For all partitioning methods, the sum of the nodes of each part is larger than the 
original, represented by the none partitioning version. On the other hand, the total number of nodes 
of States Lex partitioning is not lower than that of the other partitioning methods, suggesting that the 
improvements in runtime with respect other partitioning methods is thanks to the fast computation of the 
lexicographic splitting procedure. 

Figure [8] represents the maximum size of the BDDs involved in a single image, determining the 
minimum memory requirements for performing the task. In this case, applying partitioning seems to 
help in most cases. Partitionings balancing the number of nodes, FoldNodes Compress 8 and Fold Nodes 
Shortest Path 8 achieve the best performance in this metric. The lexicographic partitioning is also able 
to obtain significant reductions in the amount of required memory. 

7 Conclusions and Future Work 

In this paper we have shown an insightful approach for a balanced disjunctive image computation by 
splitting the BDD representing the state set into equally sized subsets to strongly solve games. A new 
BDD-splitting method based on a lexicographic criteria have been presented and compared to other parti- 
tionings in game solving tasks, showing that it is an effective method to reduce the memory requirements. 

Even though we concentrated on game playing in AI the methods we proposed are general and likely 
applicable to other areas, such as the formal verification of sequential circuits. In the model checking 
domain BDD partitioning techniques, e.g., by EUl [TOl . often show significant advances for executing 
the image operation, but usually provide no control on the size of the BDD, nor on the number of states 
represented. Moreover, the actual work executed for computing the image appears to correlate not only 
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with the BDDs in the input but also with the number of states represented [22]. The core advantage of 
partitioned BDDs is that one can gain depth in some of the partitions and locate errors fast. A complete 
search of such prioritization is involved, as information for backtracking has to be maintained. For this 
case lexicographic partitioning can help to gain a better search control. Some potential may be obtained 
in using different variable orderings in different partitionings, but this will make the algorithms more 
complicated. 

Previous work on improved BDD partitioning and guided exploration in AI includes work by |[9l ITTI 
that partitions the state sets along different g- and /j-values and along the difference in the /j-value. 
Using a lex-partitioning in the state sets prior and posterior to computing the image, the search can be 
distributed. Each computing node is responsible for computing images in the lexicographical window it 
is assigned to. As the images are computationally expensive, sending around the BDDs is negligible. By 
adapting the window sizes different forms of dynamic load balancing are immediate. 

Explicit search can be more space-efficient if perfect hash functions are available. With ranking and 
unranking we can eventually connect a symbolic state space representation with BDDs and an explicit 
bitvector based exploration. The BDDs can serve as a basis for a linear-time ranking and unranking 
scheme. As we have control over the number of states in a BDD, we can switch between symbolic and 
explicit state space generation when the available main memory is sufficient to cover the partitioned state 
sets. This provides a combination of the two methods, where the BDDs are used to define hash functions 
for addressing states in the bitvector representation of the state space. 

In game playing we can think of a layered approach to perform forward search with a BDD and 
retrograde analysis that changes from symbolic to explicit-state representation to strongly solve a game, 
by means that the solvability status for all states is computed. In case of exploring domains with complex 
cost functions in AI planning, a breadth-first BDD enumeration might be feasible, while computing the 
optimal cost is much harder. 

As the computation of the reachability set is done in compact form the problem of invalid (unreach- 
able) states in the backward traversal can be avoided. Due to the partial description of the goals there 
are many planning domains where the set of backward reachable states is much larger than the one in 
forward search. Consider the sliding-tiles puzzle with n tiles and with the blank position not mentioned 
in the goal state. The inverse of planning operators that move a tile has the position of the blank and the 
tile to be moved in the precondition, and the exchange of tile and blank in the effects. Since the blank 
position is not known to the planner backward exploration will generate states with tiles on top of each 
other, so that with n" the set of backward reachable states is exponentially larger than the number of n ! /2 
forward reachable states. 
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